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Abstract 

A method for embedding graphs in Euclidean space is suggested. The method 
connects nodes to their geographically closest neighbors and economizes on the 
total physical length of links. The topological and geometrical properties of scale- 
free networks embedded by the suggested algorithm are studied both analytically 
and through simulations. Our findings indicate dramatic changes in the embedded 
networks, in comparison to their off-lattice counterparts, and call into question the 
applicability of off-lattice scale-free models to realistic, everyday-life networks. 
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1. Introduction 



The Internet and the World Wide Web (WWW), the electricity power grid, 
networks of flight connections, of social contacts, and neuronal networks of the 
brain are few of the many examples of networks that surround us and that 
may be usefully described as graphs [1,2,3,4,5]. 

Graph theory is rooted in the 18th century, beginning with the work of Euler. 
Early efforts focused on properties of special (and usually small) graphs. In 
the 1960s, Paul Erdos and Alfred Renyi [6,7,8] initiated the study of random 
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graphs, known also as ER graphs. The unlimited size and randomness of ER 
graphs made them natural contenders for models of large networks encoun- 
tered in everyday life. 

In 1967 the psychologist Stanley Milgram asked himself how many acquain- 
tances, on average, connect between a person on the East coast and another 
on the West coast of the USA. Following some research he finally concluded 
that the number of acquaintances is surprisingly small, just about six [9]. This 
celebrated "six degrees of separation" achieved widespread popularity. It was 
later realized that an average short path, typically of order IniV (N being 
the number of nodes, or the size of the network), connects between randomly 
selected nodes in most naturally occurring networks. Recognizing Milgram's 
contribution the effect came to be known as the small world property of net- 
works. ER graphs possess the small world property. 

However, an important ingredient was still missing. In networks of social con- 
tact the people that are connected to a certain individual (such as family, 
acquaintances at work, and friends) are highly likely to be connected among 
themselves. This high degree of clustering is absent in ER graphs. Small World 
Networks, introduced by Watts and Strogatz [11,10], exhibit high degrees of 
clustering as well as the small world property. They mark an important stage 
in the modeling of everyday networks and are studied in their own right. 

It was recently realized that in addition to a high clustering index and the small 
world property real life networks exhibit yet a third important characteristic: 
a scale-free degree distribution. The degree of node % is the number of links, fcj, 
connected to the node. The likely degree of the various nodes in most observed 
networks follows a power-law distribution: 

P(k) = ck~ x , m<k<K, (1) 

where c is an appropriate normalization factor, m is the minimal degree of 
any given node, and the cutoff degree K depends on the size of the network, 
K ~ miV 1 /^ -1 ' [12,13]. The term scale-free refers to the fact that the moments 
(k n ) for n > |_AJ do not exist (diverge), in the limit of iV — > oo. Scale-free 
networks possess the small world property, and it is possible to construct 
them with a high degree of clustering. Many important examples follow this 
pattern, including the Internet [14,15] and the WWW (both for incoming and 
outgoing links) [16,17], social networks, and virtually any large network arising 
in some natural context. See [3,4], for excellent reviews, and [5], for a timely 
anthology. 

An oft-neglected aspect in the modeling of everyday-life networks is the fact 
that they are embedded in physical (Euclidean) space and possess a geography, 
in addition to their topology. The spatial location of nodes and the length of 
the connecting links is never a consideration in the models of networks dis- 
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cussed above. Nevertheless, routers of the Internet and social networks lay on 
the two-dimensional surface of the globe; neuronal networks in brains occupy 
three-dimensional space, etc. The likelihood of connections between the nodes 
in such networks is certainly affected by their geographical proximity, and one 
expects nontrivial consequences arising from this interplay between geogra- 
phy and topology [18]. Yet, most studies so far have focused on topological 
networks, where the nodes and links exist in some abstract space, devoid of 
metric. 

In this paper we consider the embedding of networks in Euclidean space. In 
Section 1, we discuss general aspects of embedding and introduce a specific 
algorithm for the embedding of random networks of arbitrary degree distri- 
bution. This algorithm favors connections to nearest nodes and economizes 
on the total length of the links. We show that infinite networks can be thus 
embedded if and only if the degree ditribution has compact support, that is, 
provided that there exists a sharp upper cutoff K < oo for the degree of 
any given node. If the distribution has no compact support, then embedding 
introduces an artificial cutoff and is only possible in a restricted sense. In 
Section 2 we apply the embedding algorithm to the widespread case of scale- 
free networks and study key structural properties of the resulting lattices. We 
conclude with a discussion of open work and a comparison to an interesting 
related algorithm, proposed by Warren, Sander, and Sokolov [19], in Section 3. 



1 The embedding algorithm 

Our general problem is that of embedding a given infinite graph in Euclidean 
space. Suppose first that the graph contains no cycles (loops), i.e., it is a tree. 
Let the volume of a node be v. The total volume of nodes within chemical 
shell / (nodes up to / links away from a given node) increases as (k) l v, where 
(k) = J kP(k) dk is the average degree of the nodes. On the other hand, the 
volume enclosed within a Euclidean distance r scales as r d (in (^-dimensions) . 
Unless the links become progressively long, the exponential growth in / cannot 
be sustained within the much slower algebraic growth in r. If the network is 
to be statistically homogeneous, a lengthening of successive links is unaccept- 
able. We conclude that infinite trees, such as the Cayley tree (Fig la), are not 
embeddable in Euclidean space [20]. Allowing for loops, the volume constraint 
is reduced to the extent that embedding might be possible while retaining 
homogeneity, and without a change in the degree distribution. An example is 
shown in Fig lb. 

In order to make further headway we restrict ourselves to random graphs, of ar- 
bitrary degree distribution P{k), and the following embedding algorithm [21]. 
To each site % of a d-dimensional lattice assign a random connectivity ki, taken 
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Fig. 1. Embedding problem. The Cayley tree (a) is a regular graph (all nodes have 
the same connectivity, k = 3) without loops. It cannot be embedded in Euclidean 
space, unless one allows for loops: The hexagonal lattice (b) is one example of 
embedding achieved in this way. Notice that the degree distribution remains un- 
changed. 

from the degree distribution P{k). Select site j at random and connect it to 
its closest neighbors, until its preassigned connectivity kj is realized, or until 
all sites within distance 



have been explored. A link to site I is allowed provided that: (a) the site is 
not saturated (its connectivity has not yet reached the preassigned ki), and 
(b) its distance from site j is smaller than Ak\^ d . Repeat for all sites. This 
algorithm makes sense in the context of social networks, where connections are 
typically confined to one's immediate neighborhood. The implied economy in 
the physical length of links might render the algorithm useful for the modeling 
of other networks of interest. 

Suppose that one attempts to embed an infinite network, of degree distribution 
P(k), in an infinite lattice, by the above algorithm. Nodes with a connectivity 
larger than a certain cutoff k c (A) cannot be realized, because of the possible 
saturation of surrounding sites. Consider the number of links n(r) entering a 
generic node from a surrounding neighborhood of radius r. Sites at distance 
r' are linked to the node with probability 



where Sd is the surface area of the <i-dimensional unit sphere. Reversing the 
order of integration and carrying out the spatial integral, we obtain 




(2) 



poo 

Pr(k > (r'/A) d ) = / P{k')dk 



J(r'/A) d 



Thus, 
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where V d = is the volume of the rf-dimensional unit sphere. Finally, taking 
the limit r — > oo, we derive the cutoff 



k c = lim n(r) = V d A d 



(k). 



(3) 



We distinguish between two cases for the degree distribution P(k): with and 
without compact support. The distribution P(k) has compact support if there 
exists a K < oo such that P(k) = for all k > K. All finite networks have a 
distribution with compact support, however infinite networks might too have 
compact support (an example is the Cayley tree of Fig. la). If this is the 
case, then (k) < K is finite, and one can always select A large enough so that 
k c > K. In other words, if there is compact support the network is embeddable. 

If the network is infinite and without compact support, then if (k) < oo, for 
any finite A the cutoff k c is finite and the tail of the distribution for k > k c 
is chopped off. (Infinite networks with diverging (k) are pathological — we are 
not aware of any important practical application — though they might be 
embeddable.) The cutoff connectivity k c implies a cutoff length 



The embedded network displays the original (chopped) distribution up to 
lengthscale £ and repeats, statistically, at lengthscales larger than £. Indeed, £ 
is analogous to the correlation length in percolation theory, above criticality, 
where the infinite cluster is fractal for r < £ and is homogeneous for r > 
£ [22,23,24]. We emphasize that the cutoff k c is of consequence even if the 
distribution P(k) is narrow, that is, even if f^ P{k) dk -C 1. Indeed, even in 
such a case the lengthscale £ is finite (and controlled mainly by A). 

In summary, finite networks (all practical situations) are always embeddable 
by ourproposed algorithm. Infinite networks are strictly embeddable only if 
their degree distribution has compact support. Otherwise, they are embed- 
dable in a restricted manner, with a cutoff k c imposed by the embedding, and 
statistical repetition at lengthscales r > £ ~ A 2 . 



i = r{k c ) = {V d {k)) l / d A 2 . 



(4) 
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2 Embedding of scale-free networks 



We now apply the embedding algorithm to the widespread case of scale-free 
networks, Eq. (1). Generally, we consider embedding in rf-dimensional lattices 
of size R, though in the numerical simulations shown below we limit ourselves 
to two-dimensional square lattices with periodic boundary conditions. Because 
the lattice has a finite number of sites, N ~ R d , the degree distribution (1) 
has compact support [12,13]: 

K ~ miV 1 /^ 1 ) ~ tfW- 1 ) . (5) 

This, in conjunction with (2), implies a natural cutoff length 

r max = r(K) ~ AflVd-D . ( 6 ) 



The interplay between the three lengthscales, R, £, r max , determines the nature 
of the network. The embedding cutoff k c is imposed only if £ < r max , and there 
is statistical repetition for r > £. Otherwise (r max < £), the natural cutoff of K 
is attained. As long as min(r max ,£) <C i?, the finite size of the lattice imposes 
no serious restrictions. Otherwise (min(r max , £) > R), finite-size effects become 
important. In short, there exist 6 different regimes, characterized by the rela- 
tive ordering of R, £, and r max : Regime A: r max </?<£. Natural cutoff K is 

attained, and no finite-size effects. Statistical repetition occurs at lengthscales 
f > fmax, rather than £. Regime B: r max <£</?. Same as regime A, but 

statistical repetition occurs at lengthscales r > £. Regime C: £ < r max < R. 

Cutoff k c is imposed; nofinite-size effects. Regime D: £ < R < r max . Same as 

regime C. Regime E: R < £ < r max . Cutoff /c c is imposed; strong finite-size 

effects prevent statistical repetition. Regime F: R < r max < £. Natural cutoff 

K is attained; strong finite-size effects. The various regimes are demarcated 

by the lines 
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(7) 



as shown in Fig. 2. In Fig. 3a, we present typical networks that result from 
our embedding method, for I = 2.5 and I = 5. Long range links become 
more noticeable as / decreases. In Fig. 3b, we show the same networks as in 
Fig. 3a, where successive chemical shells are shaded differently. Chemical shell 
/ consists of all sites that can be reached by a minimal number of / connecting 
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Fig. 2. Regimes of embedded scale-free networks. A: r max < R < £, B: r max < £ < R, 
C: £ < r maa; < i?, D: £ < i? < r ma:E ,E: i? < £ < r ma:E , F: i? < r ma:E < f . The diagram 
can be reduced into just four regimes separated by the cutoff k c and by finite-size 
effects. A and B: no cutoff and no size effect; C and D: cutoff and no size effect; 
E: cutoff and size effect; F: no cutoff but size effect. The two symbols indicate the 
parameters corresponding to Fig. 3b, A = 2.5 (diamond) and A = 5 (circle). 

links relative to a given site (the central site, in the figure). For our choice of 
parameters, / = 5 falls in the region of £ > r max , while for I = 2.5, £ < r max . In 
the latter case, we clearly see the statistical repetition beyond the lengthscale 
f (Fig. 3b, / = 2.5). 

The degree distribution of the embedded networks is illustrated in Fig. 4. 
In Fig. 4a, various values of A were selected such that £ < r max , and the 
distribution terminates at the imposed cutoff k c . The scale-free distribution is 
altered slightly, for k < k c , due to saturation effects, but the overall trend is 
highly consistent with the original power-law. The data-collapse shown in the 
inset confirms the power-law, as well as the scaling of the cutoff k c ~ A d . In 
Fig. 4b, A = 10 was fixed and values of I were selected such that £ > r max . The 
natural cutoff K is now attained, and the data collapse at the inset confirms 
the power-law distribution as well as the known relation K ~ mR d ^ l ~ 1 \ 

We now turn to the relation between the Euclidean metric and chemical length 
in our embedded scale-free networks. The chemical distance I between two sites 
is the minimal number of links I connecting the two. Thus, if the Euclidean 
distance between the sites is r, then 

I ~ r dmin (8) 

defines the minimal (chemical) length exponent d m i n . In order to compute d m { n 
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A = 2.5 



A = 5 




Fig. 3. Embedded scale-free networks, (a) Actual networks (links are highlighted) for 
graphs with I = 2.5 and 5, embedded in 50 x 50-sites square lattices, (b) Chemical 
shells, for the same parameters as in (a), but for lattices of size 300 x 300. Notice 
the statistical repetition at lengthscales greater thanf;, for I = 2.5. 

we regard the chemical shells as being roughly smooth, at least in the regime 
£ > '"max, as suggested by Fig. 3b (I = 5). Let the width of shell I be Ar(l), 
then 

1= [dl= [ ~ r d ^ , (9) 



Ar(l) 

since Al = 1. The number of sites in shell I, N(l), is, on the one hand, 
N(l) ~ r(l) d ~ l Ar(l). On the other hand, since the maximal connectivity in 
shell / is K{1) ~ Nilf/^, the thickness of shell (/ + 1) is Ar(l + 1) ~ 
r(K(l)) ~ AK{l) l l d . Assuming (for large I) that Ar(l + 1) ~ Ar(l), we obtain 

Ar(l) ~ r (rf-D/W-D-i] . (io) 
Using this expression in (9) yields 

1-2 

dmin - l-i-i/d • (H) 
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Fig. 4. Degree distribution of embedded scale-free networks, (a) when the cutoff k c 
is imposed, and (b) when the natural cutoff K is achieved. For (a) the lattice size 
is R = 400, the distribution exponent is / = 2.5, and A = 2 (circles), 3 (squares), 
4 (diamonds). The inset confirms the scaling of Eq. (3). For (b) R = 100, A = 10, 
and I = 2.5 (circles), 3.0 (squares), 5.0 (diamonds). The inset confirms the scaling 
of Eq. (5). 
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Fig. 5. Minimal length exponent d m i n . (a) d m in vs. /. The analytic result of Eq. (11) 
(curve) matches nicely the results measured from simulations (squares), (b) Data 
colapse for the scaling function &(l dl / R d ), for A = 4 and lattice sizes R = 1000 
(circles), 2000 (squares), 2500 (diamonds) and 3000 (triangles). 

Thus, for d > 1, the dimension d m \ n < 1. This result is opposite to all known 
disordered media, where d min > 1. (A particularly simple example is provided 
by a polymer chain, or its self-avoiding-walk model, where the end-to-end 
distance is well-approximated by the Flory relation: r ~ / 3 /( d + 2 ) ; j.e.,d m m = 
(d + 2)/3.) We have also computed the fractal dimension of the newtorks and 
found that the interior of the Z-clusters is compact, with df — d [21]. (Fig. 3 
suggests that the hull of the clusters is fractal.) This resultwas confirmed 
by simulations. It then follows that the fractal dimension of the network in 
chemical space is also anomalous: di = df/d min = d/d m \ n > d (for d > 1). 



In Fig. 5a, we plot d m i n as measured from simulations, and compared with 
the analytical result of Eq. (11). The scaling suggested in Fig. 5b, N(l) ~ 
/ d i _1 $(/ d i /R d ), is valid only for £ > r max . For R — >• oo, we expect that the 
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network is scale-free up to lengthscale £ and the analogous scaling would 
be N(l) ~ Z d, - 1 tf(Z < Vf d ), where > 1) ~ x^W*. It remains an open 
challenge to conceive of a general scaling relation that would encompass all of 
the regimes A - F. 



3 Discussion 

In summary, we propose an algorithm for the embedding of networks in Eu- 
clidean space. Finite networks can always be embedded in this way. Infinite 
networks are strictly embeddable only if the degree distribution has compact 
support. If not, an artificial cutoff k c results from our embedding technique, 
and the networkrepeats itself, in a statistical sense, at lengthscales greater 
than £(/u c ). We have applied our embedding algorithm to scale-free graphs, 
and studied the resulting networks. 

Concurrently with our work (and independently), Warren, Sander and Sokolov [19] 
proposed an embedding algorithm very similar to ours. The main difference 
in their approach is that connecting to saturated sites is allowed, and a node 
is connected to as many of its closest neighbors as necessary, until its target 
connectivity is fulfilled. In this case, our computation of k c in Section 1 is still 
valid, only that now it is a lower cutoff for the connectivity of the sites. On 
the other hand, there is no restriction, in this version of embedding, on the 
upper cutoff of the distribution. We anticipate that £ here plays the role of a 
'geometrical' lengthscale, similar to the lattice spacing for a critical percola- 
tion cluster grown on a lattice. (The cluster is self-similar only at lengthscales 
larger than the lattice spacing.) 

Embedding scale-free networks in Euclidean space results in some dramatic 
changes from the original graphs. Consider, for example, the small world prop- 
erty, common to off-lattice scale-free networks grown by any of the known 
techniques. The small world property disappears at lengthscales greater than 
£: two sites separated by a physical distance r = n£, n >> 1, would be con- 
nected by at least n intervening links, due to the statistical repetition. The 
number of sites withinradius r is N ~ n d ^ d , so / ~ N 1 ^. Not surprisingly, 
this is the same scaling as for Euclidean d- dimensional lattices. For distances 
r < £, wehave I ~ r^" ~ j\rd min /<^ wri i cri represents small-world behavior, 
since it improves on the Euclidean I ~ N 1 ^, but a far cry from the off-lattice 
I ~ In N. Another example of a dramatic change is the qualitative difference 
between percolation inoff-lattice and embedded networks. Indeed, Warren et 
al., [19] find that the percolation transition takes place in embedded lattices 
even for / < 3, in contrast to off-lattice scale-free networks [3,12]. In view of 
the fact that many everyday-life networks are embedded in physical space, 
we ought to reconsider how the topological properties of scale-free graphs - 
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properties that we normally attribute to the physical networks — are affected 
by the embedding. 
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